A possible low-level explanation of "temporal dynamics of brightness 

induction and White's illusion" 

Subhajit Karmakar*and Sandip Sarkar^ 

Microelectronics Division, Saha Institute of Nuclear Physics 
Kolkata- 700064, INDIA 



ON 

. Abstract 

O 

Based upon physiological observation on time dependent orientation selectivity in the cells of macaque's 
primary visual cortex together with the psychophysical studies on the tuning of orientation detectors in 
human vision we suggest that time dependence in brightness perception can be accommodated through the 
■ time evolution of cortical contribution to the orientation tuning of the ODoG filter responses. A set of 

Difference of Gaussians functions has been used to mimic the time dependence of orientation tuning. The 
tuning of orientation preference and its inversion at a later time have been considered in explaining qualita- 
tively the temporal dynamics of brightness perception observed in " Brief presentations reveal the temporal 
dynamics of brightness induction and White's illusion" for 58 and 82 ms of stimulus exposure. 



oo 



> 

U 

^ Introduction 

Psychophysical studies on human observers suggest that our visual system perceives the luminance of a target 
^ ■ region depending upon the luminance of its surround. In a spatial square grating consisting of alternate black 
and gray stripes, the gray stripes will be looking brighter than the same gray stripes appear with white border- 
■ ing stripes. This is an example of brightness induction which produces brightness contrast effect. 

CO 

ON 
O 
0\ 



It has been observed by iDe Valois et al.l (1986) that brightness modulation on a static gray patch due to the 
luminance modulation of its large surround depends on the temporal frequency of the luminance modulation. 
At a lower temporal frequency (below 2.5 Hz) the brightness modulation is perceived significantly but with the 
increase in temporal frequency the effect of modulation is completely diminished and the central patch appears 
to be static gray. Based on this finding, Rossi and Paradiso (1996) have explored whether the temporal cut-off 
of brightness modulation depends on the spatial scale of the stimulus. Modulating the luminance of every 
other stripe of a square grating and keeping the intervening stripe with constant static luminance, they have 
observed that temporal cut-off of perceiving brightness modulation on the static gray stripes decreases with 
decrease in spatial frequency of the square grating. The authors have concluded that process controlling the 
brightness change due to induction is relatively slower than the process involves in brightness change from direct 
luminance modulation. The slower process is supposed to get mediated via filling-in mechanism because the 
signal of luminance contrast appears at the edges by a faster mechanism travels with a finite speed to influence 
the brightness of a uniform region neighbouring it. Therefore, the wider stripe of the square grating will take 
longer time to get filled in compared to the thinnest one. The speed of filling-in comes out to be 140 — 180°/s 
when estimated from the phase measurement on the sam e experiment. 

Filling-in can play a crucial role ( Rossi fc Paradise! Il996l ) in producing the temporal limit of brightness induc- 



tion in square wave grating as well as achromatic Craik-O'Brien-Cornswee t (COC) effect rtDavev et al.L Il998f ) 



but fails to explain the temporal limit of chromatic version of COC effect flDevinck et al 1 l2007h. The tempo- 



ral cut-off frequency for perceiving chromatic COC illusion decreases with increasing spatial frequency. More 
so, while increasing the spatial frequency above 0.02c/deg, they observed that temporal modulation cut-off for 
achromatic grating followed the shape of human achromatic Contrast Sensitivity Function (CSF) which is found 
to be inconsistent with filling-in theory. 

If brightness induction is supposed to get mediated via filling-in then it will be a slow process. Revisiting 



* Email: subhajit. karmakar®. saha. ernet.in(Corresponding author) 
t Email: sandip. sarkar@saha.ac. in 



1 



the idea, Robinson fc de Sal (2QQ8) have explored whether the strength of brightness induction decreases as the 



exposure time of the stimulus is made shorter and shorter and at what limit of exposure time the illusion gets 
disappeared. The limit was expected to be different for different spatial frequencies as it was observed in the 
earlier experiments. They have replaced the modulating inducing stripes of the square grating with static ones. 
The whole grating is displayed for a short exposure of time. Immediately after it, a noise mask of same hori- 
zontal frequency is set on for a comparatively longer time to stop further processing initiated by the previous 
stimuli. Subjects were asked to match the brightness of a particular grating stripe of luminance either of 31 or 
72 cd/m 2 which is bordered by either of 12 or 102 cd/m 2 . 

It is observed that human observer can perceive brightness induction (brightness contrast) only for a brief pre- 
sentation (58 ms) of the stimuli irrespective of their spatial frequencies. Contrary to their expectation, it can be 
observed from the results of their experiments 1 and 2 that induced brightness on the target stripe of the square 
grating is maximum for shortest on time (58 ms) and with a prolonged exposure (82 ms and more) its strength 
gets reduced. The difference in matching luminance of the same target stripe appearing with different bordered 
stripes (12 and 1 02 cd/m 2 ) als o decreases with exposure time resulting in a decrease in illusion strength. Even, 
White's illusion (|Whitd . Il979h is also perceived for a short exposure (82 ms) and in contrast to brightness 
induction, illusion strength increases if more exposure time is all owed. 

Though debated by many researchers (Ro binson fc de Sal . l2008h including themselves on the speed of filling 



in, the widest grating ( 10.6°) in their experi ment are suppose d to get filled in within 29-37 ms with a speed 
of 140 — 180°/s estimated from the work of iRossi fc Paradisol (1996). The estimated filling-in time does not 
include any other temporal delay. Faster filling-in may be consistent with the initial brightness perception 
for shortest exposure length (58 ms) if the signal delay from retinal ganglion cell to VI is considered. But 
authors have also argued that the observed temporal dynamics of brightness induction can not be explained 
in the light of filling-in because the speed of filling-in could be too fast to limit the speed of brightness perception. 

However, the temporal frequency cut- off obtained for differ ent spatial frequencies while observing brightness 
modulation in achromatic COC effect (|Devinck et al. . 2007 ). agrees well with the shape of human achromatic 



Contrast Sensitivity Function (CSF). Therefore, the authors have suggested that temporal dynamics of COC 
illusion may arise due to spatio-temporal filtering of the stimulus by human luminance systems, instead of 
mediated via filling -in. On the other hand, in a steady visual condition psychophysical measu r emen t on bright- 
ness contrast and assimilation can be modelled (|Blakeslee fc McCourtL 1 19991 : iRobinson et all l20Q7h by spatial 
filtering followed by RMS response normalization (ODoG / FLODoG). According to Robinson et al. (2008), 
these multi-scale models do not include temporal dependence on spatial scale so they are compatible with the 
fast brightness perception for the stimuli of any spatial scale. But, there is no explicit time dependence in these 
models, therefore, they can not be used to predict the time course of brightness illusion as it is observed in 
their experiment. One possible way to incorporate time aspect in the ODoG/FLODoG m odels is to consider 
that spatial filtering and response normalization are completed at different time instances (jRobinson fc de 
2008). The onset of noise mask after a short presentation of the stimuli is thus supposed to interfere with the 



ongoing response normalization process if it is not completed. Therefore, the incompleteness of the processing 
is probably getting reflected in the induced brightness perceived for different length of exposure. 

The models like FLODoG and ODoG exhibit linearity while computing a weighted sum of the intensity distri- 
bution through spatial filtering but appear nonlinear in performing response normalization. This property is 



very often observed in the response of simple cells in the primary visual cortex of Macaque (jCarandini et al 



Il997h when visual information falls on their receptive field. Nonlinearity in cell's response can be accounted for 
if shunting or divisive inhibition among a large number of cortical cells is being considered. Thus, in visual 
network, intracoritcal feedback which possibly provides shunting inhibition, results into response normalization 
in their model. 

( I) Intracoritcal feedback and Orientation tuning dynamics in VI 

Orientation tuning is an emergent property of the cells in primary visual cortex (Hu bel fc Wiesej[l962h . This 



orientation selectivity of the cortical simple cells in primary visual cortex may arise due to geometrical align- 
ment of the LGN receptive fields. Sharpness of the orientation tuning will depend upon the aspect ratio of the 
convergent feedforward structure. However, weakly converging thalamocort ical input either or a long with the 



cortical inhibition cannot explain the contrast invariant orientation tuning ( So mers et all [1995) of the cells in 
VI. 

In addition, experimental studies bv lRingach et al .1 (1 99 7) have demonstrated that orientation tuning in the VI 
of macaque evolves with time. The broadly tuned neurons in the layer 4C a and 4C/3 which receive direct input 
from LGN do not change their orientation preference in course of time though overall response is reduced. On 
the other hand, neurons in the output layer of 4B, 2, 3, 5 or 6 changes their preferred orientations with time. For 
example, orientation distribution of a typical neuron in 4B shows a narrow peak around its preferred orientation 



2 



at 53 ms from the onset of that particular orientation and produces a Mexican hat distribution around 59 ms. 
Finally at 71 ms, it exhibits broader tuning around an orientation orthogonal with respect to that of the earlier 
one. 

Though there exist a long debate and several modifications to the feedforward model of orientation tuning 



(jTeich fc Qianl 120061 ). the temporal dynamics of o rientation tuning obs erved in cells of VI may be accounted 



for by recurrent cortical excitation or inhibition ( Ringach et all Il997l ). Even recurrent netw ork models are 



considered to be con sistent with the observations on orientation plasticity (|Dragoi et all |2002[ ) in cortical cells 
(jTeich fc Qianl . l2QQ6h . 



This recurrent model considers intracortical feedback crucial for sharpening the orientation s electivity of co rtical 
cell that receives weak feedforward orientation bias from converging LGN input. According to lSomers et al. (1995), 
in orientation domain, a balance between the narrowly tuned intracortical excitation and broadly tuned intracor- 
tical inhibition can produce contrast invariant cortical orientation tuning from the weekly tuned thalamocortical 
excitation in cat's VI. 

(II) Relationship between orie ntation dynamics and psychophysical observations 

Similar to their earlier study ([Ringach et all 119971 ) on orientation tuning dynamics in VI of macaque, Ringach 
(1998) has conducted a psychophysical measurement on human observer with a sequence of flashed sinusoidal 
gratings of random orientations and spatial phases. It has been observed that orientation detector in human 
visual system exhibits a distribution of 'Mexican hat' shape which resembles the orientation dist r ibution of some 



single neurons in the layers 4B , 2+3 and 5 of macaque's primary visual cortex ([Ringach et all 119971 : iRingachL 
1998). The author has inferred from their findings that lateral inhibition in the orientation domain which is 
thought to be responsible for tuning dynamics in VI of cat and monkey, is probably present in the human 
v isual cortex. Ev en the orientation inversion in the probability distribution observed for some of the cells (Fig. 
2 iRingach et all 1997) is also b een supported by the psychophysical study o f cross o rientation interaction in 
human vision bv iRoeber et al.l (2QQ8). Following the similar methodology of lRingachl (1998) , they have found 
that when the inter stimulus interval is 100 ms, the aligned gratings result in suppression but the misaligned 
gratings favour facilitation. 
(IV) Timecourse of brightness coding in VI 

More so, with t he study of CI com ponent of visual event related potential (ERP) in human observer perceiving 
White's illusion jMcCourt fc Foxel (2004) have reported that the perceived brightness difference in White's effect 
is reflected in the early phase (50-80 ms after the onset of the stimulus) of CI. This early phase represents the 
initial activation of area like VI in the striate cortex. 

In the following sections, we have investigated with the stimuli used by iRobinson fc de Sal (2008) to check 
whether the time dependent intracortical feedback which generates the dynamics of orientation tuning in VI 
can be used along with static ODoG filters to predict qualitatively the nature of brightness perception over time. 



Possible low-level model of temporal dynamics of brightness induction 

Psychophysical observation together with physiological measurement on orientation selectivity suggest that time 
evolution of orientation distribution of cells in VI might have an effect on the time course of brightness per- 
ception. The dynamics of orientation tuning thus indicates that computational model of VI should not only 
comprise of spatial filtering by bank of static oriented filters but also include the contribution for dynamical 
response facilitation or suppression. 

The multi-scale orientation filtering (ODoG/FLODoG) which has been used for successful brightness predic- 
tion is supposed to mimic the visual processing of area like VI. So, the same spatial filters can be thought 
of using; in the pr e dictio n of brightness perception over time if intracortical feedback is associated with them 
([McGourt fc Foxel . 12004 ) . 



It can be assumed that observed probability distribution of orientation selectivity of the cells in VI ([Ringach et al 



[19971) represents the orientation impulse response at a particular instant. If the orientation detectors in our visual 
system are identical and connected with each other in a ring fashion then response of the orientation detectors 
at that particular instant can be computed by a circular convolution of the orientation impulse response and 
the input orientation distribution at that moment. 

If we consider the initial time delay for the visual signal from retina to reach the cells in VI is about 20 ms, then 
the signal arising due to onset of noise mask will s top additional processing of the stimulus signal after 78 ms 
from the onset of the stimulus ([Robinson fc de Sa . 20081 ). Therefore, it can be expected that initial brightness 



percept of the stimulus formed at 78 ms is due to the effect of time evolution of orientation detectors for 78 
ms. Similarly, when the exposure time is increased to 82 ms i.e., at the time delay of 102 ms from the onset of 



3 



the stimulus, the observed effect will be changed due to the change in orientation impul se response in course of 
time. If we look at the time evolution of orientation tuning of some cells in VI (Fig. 2 lRingach et al. .l997). it 
is found that around 20 ms from the sharply tuned state, the distribution gets inverted and relatively broader 
tuning appears around an orientation orthogonal to the most preferred orientation of the earlier distribution. 
Temporal dynamics of the response distribution of orientation detectors can be implemented in the following 
way. 

(I) Generation of the orientation impulse respo nse 



Following the recurrent model of lSomers et al.l (1995), in our proposition we have considered a balanced Differ- 



ence of Gaussians (DoG) to construct the Mexican hat shape of the orientation impulse response at time Tl 
from the onset of the stimulus. 

h(0) = - 1 =— e 2 -i - e (1) 

V27T(Je V27r<Ji 

Where, a e = 7.5° and ai = 60° are considered to achieve narrow tuning halfwidth for orientaion impulse 
response. Ok and d\ are the mean position of the Gaussians generating the time dependent orientation impulse 
response h(0) and for Tl ms of exposure they coincide. Whereas, for T2 ms of exposure, the Gaussians are 
centred on two orthogonal orientations with a e — 25° and = 60° to achieve slightly broader tuning around 
orthogonal orientation, in compared to the previous condition. Balance condition will keep area under the 
distribution curve constant before and after tuning. Modelling inversion of ori entation impu lse response with 



mean shifted DoG may not have the physiological equivalence like the model of lSomers et al.l (1995) but can be 
treated as a computational manipulation to mimic the physiological observation. 

(II) Response of the orientation detectors 

Similar to the local RMS computation in FLODoG model, input orientation distribution from the output of j th 
scale ODoG filter is computed from Gaussian weighted mean response of a region of area 3a J e x 3a 3 e . Where, a J e 
is the standard deviation of the center Gaussian of j th scale ODoG filter. 

Ortrij (<9) =< I * ODoGj (<9) > (2) 

Oj (0) = rjj Ortnj (0) + aj h(0) ® Ortrij (0) (3) 

It can be anticipated from the observations of iRingach et al.1 (1997) on dynamics of orientation selectivity that 
both the coefficients n and a in our expression can be varied with time. Therefore the model's prediction could 
be different for several combinations of n and a. However, in our study, we do not vary either with time or 
scale but choose different Oj for two different instants. 

Here, we have considered the same frequency power law as it has been used in ODoG model bv lBlakeslee fc McCourth 999) 
in combining information of a single orientation over multiple scales. The exponent of the power function was 
selected as 0.01 to approximate supra-threshold contrast sensitivity of human visual system. 

max \A(0)\ = max \ ^PjOj(0)\ (4) 

3 

(3j = ujj- 1 are the spatial frequency weight factors, uoj s are the spectral mode of the multi-scale ODoG. 

(III) Prediction of brightness from the orientation distribution 

If a simple cell in VI is excited by a moving grating for a short time of exposure, the Post Stimulus Time 
Histogram (PSTH) obtained from the single cell recording shows that firing r ate increases to a max imum around 
50 ms from the onset of the sti mulus then decays towards a sustained level (lAlbrecht et all [2Q02h . Looking at 



this finding, iRobinson fc de Sal (2008) suggested that fastest brightness percept might arise from the prediction 
of elevated firing rate by our visual system. Since, our visual system treats the 'white' and 'black' with equal 
status, our proposed model decides the brightness of an induced stripe of the square grating depending on the 
maximum contrast response obtained from equation (4) for a very short presentation of time e.g. 58 ms and 82 
ms from the onset of the stimuli. 



Simulation 

Stimuli used in our computer simulation are the same as those used in the experiment of Robinson fc de Sal (2008) 
except the matching gray patch and its surround. The upper half of which is black and lower half is filled with 
the illusory stimuli under study. We have considered the stimulus size of 1024px x 1024px. Stripe width of the 
thinner grating is taken as 31px and that of wider grating is 340px. For generating White's stimulus, height and 
width of the test patches are considered as 62px and 31px. The oriented spatial filter outputs are generated by 



4 



the MATLAB code of iRobinson et aj (2007). Orientation filters are generated for [0° 180°) in the step of 15°. 
Mean response of visual information over all scales is evaluated at the middle of the illusory stimuli. Though 
it has been mentioned in the earlier section that window size is proportional to the scale of the Gaussian, we 
have fixed the size of the Gaussian window by 256px x 256px for estimating mean response. All r/j and aj are 
considered to be 1 during this study. 



(a) Thin square grating: 31 cd/m 2 
bordered by 12 cd/m 2 



(b) Thin square grating: 31 cd/m 2 
bordered by 102 cd/m 2 



(c) Thin square grating: 72 cd/m 2 
bordered by 12 cd/m 2 



(d) Thin square grating: 72 cd/m 2 
bordered by 102 cd/m 2 





(e) Wide square grating: 31 cd/m 2 
bordered by 12 cd/m 2 



(f) Wide square grating: 31 cd/m 2 
bordered by 102 cd/m 2 







(g) Wide square grating: 72 cd/m 2 
bordered by 12 cd/m 2 



(h) Wide square grating: 
cd/m 2 bordered by 102 cd/m 2 



72 



(i) White stimulus: 
black stripe 



gray test on 





(j) White stimulus: 
white stripe 



gray test on 



Figure 1: Illusory stimuli. Mean response is evaluated at the middle of the stimuli. 



Results and Discussion 



With the propositions made in the earlier section, our observations from the com puter simulation can be treated 
with the following categories and which could be related to the experiments of IRobinson fc de Sal (2QQ8). Be- 
cause there appears variability in the subjects' prediction while judging the brightness for same type of stimuli. 
(I) Visual information over all spatial scales is supposed to be present at both the instants and prediction is 
made based on the mean response computed by all the scales of Gaussian window. 

Brightness Induction: Figure 2(a) shows the predicted response at two different exposure lengths (say 58 ms and 
82 ms) for the thin square grating with induced stripe of equivalent luminance 31cd/m 2 bordered by inducing stripe of 
luminance 12cd/m 2 (dotted line with square) and 102cd/m 2 (dotted line with diamond) respectively. Similarly solid line 
with square and diamond symbols in the same figure represents the predicted response for induced stripe of equivalent 
luminance 72cd/m 2 . 

At both the instants, target stripe appears brighter when it is induced by the stripe of lower luminance (12cd/m 2 ) 
than of higher luminance (102cd/m 2 ). Vertical distance between the dotted lines or the solid lines is reduced at the later 
instant i.e., with the increasing exposur e length the differe nce between their predicted responses (illusion strength) gets 
reduced. This is also the observation of IRobinson Sz de Sal (2008). 

Predicted response for the wider grating also (Fig. 2b) follows the similar behaviour as that with a thinner grating. 
White Effect: Predicted response for a gray test region placed on a white stripe (dotted line) and the same on black 



5 




58 82 
(a) Thin square grating 



0. 1 




58 82 
(b) Wide square grating 




58 82 
(c) White's stimulus 



Figure 2: Horizontal axis represents two different exposure lengths in ms.(a)-(b) Predicted response of thinner (1° stripe 
width) and wider (10.6° stripe width) square grating at two different exposure lengths. Dotted line with square symbol 
represents the prediction for target stripe of luminance 31 cd/m 2 when induced by stripe of lower luminance (12 cd/m 2 ). 
Diamond symbol associated with the same line style indicates the prediction for the same target stripe when induced 
by a stripe of higher luminance (102 cd/m 2 ). Similarly solid line with same kind of symbols represents the prediction 
for a target stripe of luminance (72 cd/m 2 ). (c) Predicted response of the gray test patches of White's stimulus at two 
different exposure of time. The dotted line with square symbol represents the prediction of gray test patch placed on 
a white stripe. Whereas the solid line with the similar symbol represents the prediction for the same test patch while 
placed on a black stripe. Orientation distributions for the above mentioned cases are depicted in Fig. 5. 



stripe (solid line) of square grating is depicted in Figure 2(c) for two different (58ms and 82ms) length of stimulus expo- 
sure. If response determines the perceived brightness then the test patch positioned on a white stripe and flanked by 
black stripes on either side will be judged brighter than the similar one placed on the black stripe and fl anked by two 
white stripes, for the shortest length of exposure (e.g. 58 ms). This is opposite to th e White's illusion (| White] A 19791 ) 
what human observer perceives. However, observer participated in the experiment of Robinson fe de Sal (2008), found 
it difficult to see the test patch in the shortest time interval (58 ms). On a relatively longer exposure (82 ms), inverted 
orientation impulse response produces strong response suppression at the preferred orientation but facilitation at the 
orthogonal orientation (Fig 5i Sz j) relative to it. The predicted response of the test patches indicates that if the visual 
system follows the same rule as in the earlier instant, observers might not be able to perceive White's illusion. Though 
the difference in predicted response (Fig. 2c) is very small, the gray test patch on the white stripe still appears brighter 
than the identical one placed on black stripe. 

(II) There are seven spatial scales in the ODoG model. For 82 ms of stimulus exposure, mean response of visual in- 
formation over relatively higher spatial scale filters (three largest spatial scales of ODoG )is computed with a Gaussian 
window of smallest spatial scale among them. Prediction for wide and thin grating (Fig. 3a & b)do not alter from 
what it appears in the previous condition. The illusion strength decreases with the increase in exposure length. On the 
other hand in White's stimulus, predicted response of the gray test patch placed on the white stripe of the grating is 
appearing darker (Fig. 3c) than the same gray test patch positioned on the black stripe at later instant. Thus, the use 
of smaller sale win dow function in the prediction could be relevant with the observation (Fig 7) in the experiment of 
iRobinson &; de Sal (2008) because subjects might be trying to see the test patches clearly to judge the brightness they 
perceived. 



0.2 


0. 2 
0. 4 





0.1 










0.2 









0.15 









0. 1 




0.1 




0. 2 


♦ ' i 


0.05 


58 82 


58 82 






(a) Thin square grating 



(b) Wide square grating 



58 82 
(c) White's stimulus 



Figure 3: Horizontal axis represents two different exposure lengths in ms.(a)-(b) Predicted response of thinner (1° stripe 
width) and wider (10.6° stripe width) square grating at two different exposure lengths. Dotted line with square symbol 
represents the prediction for target stripe of luminance 31 cd/m 2 when induced by stripe of lower luminance (12 cd/m 2 ). 
Diamond symbol associated with the same line style indicates the prediction for the same target stripe when induced 
by a stripe of higher luminance (102 cd/m 2 ). Similarly solid line with same kind of symbols represents the prediction 
for a target stripe of luminance (72 cd/m 2 ). (c) Predicted response of the gray test patches of White's stimulus at two 
different exposure of time. The dotted line with square symbol represents the prediction of gray test patch placed on 
a white stripe. Whereas the solid line with the similar symbol represents the prediction for the same test patch while 
placed on a black stripe. Orientation distributions for the above mentioned cases are depicted in Fig. 6. 



(Ill) In the above two cases, mean response is evaluated at middle of the illusory stimuli. If the point of observation is 



6 



shifted towards the interface of black region and the illusory stimulus, the model's prediction differs. This is because of 
the significant contrast response produced by larger spatial scale filters. Slope of the response curve of the target stripe 
of luminance 31 cd/m 2 of square grating does not change (Fig. 4a &b) from that observed in earlier cases. But the 
slope of the response curve for the target stripe of higher luminance (72 cd/m 2 ) is reversed when it is induced by the 
b ordering stripe of l uminance 102 cd/m 2 . Similar observation is also reported by 2 out of 4 observers in the experiment 
of Robinson fe de Sal . (Fig. 4, 2008). In contrast, the response curves (Fig. 4) for wide grating at the shortest exposure 
of time crosses each other which does not follow the observation in their experiment. 




58 82 58 82 

(a) Thin square grating (b) Wide square grating 

Figure 4: Horizontal axis represents two different exposure lengths in ms.(a)-(b) Predicted response of thinner (1° stripe 
width) and wider (10.6° stripe width) square grating at two different exposure lengths. Dotted line with square symbol 
represents the prediction for target stripe of luminance 31 cd/m 2 when induced by stripe of lower luminance (12 cd/m 2 ). 
Diamond symbol associated with the same line style indicates the prediction for the same target stripe when induced by 
a stripe of higher luminance (102 cd/m 2 ). Similarly solid line with same kind of symbols represents the prediction for a 
target stripe of luminance (72 cd/m 2 ). Orient at ion distributions for the above mentioned cases are depicted in Fig. 7. 

Balanced DOG and other possibility 

In our proposition, we have considered only balanced DoG as the cortical contribution to generate the orientation distri- 
bution for different length of exposure. As a result area under the distributio n curve before and after tuning is remained 
constant. Time evolution may be explored with the use of unbalanced DoG ([Pugh et al.l ([20001 )) which considers inhibi- 
tion is stronger than the excitation. Another possibility is to use a family of DoG to get different orientation distribution 
at different time. 

While predicting the Mexican hat shape of orientation selectivity of the cells in macaque VI, ([Ringach et all [2003) have 
used the von Mises distribution to approximate cortical excitation and inhibition. This distribution function can also be 
used for predicting orientation impulse response. 
Beyond 82 ms 

Robinson et al. (2008) have investigated the dynamics of brightness perception for exposure length longer than 82 ms. 
It is observed from the matched luminance (Fig 3 & 4, Robinson et al. 2008) that induction strength gradually decreases 
with increasing exposure time and the perceived brightness tends towards the actual luminance of the target stripe of 
the square grating. In contrast, luminance matching in the White's stimulus shows that illusion strength increases with 
in crease in exposure l ength and the perceived brightness of the gray test patch shifts away from its actual luminance (Fig. 
7, iRobinson &; de Sal, 2008). Though , not exactly similar to their experiment even sluggish, fMRI studies on contrast 
detection task ([Resss &; Heegeri 2003) by human observers also indicate that activity in the early visual area like VI 
may correspond to two phases of response; immediate response due to the stimulus and later feedback signal (after 100 
ms) in generating the subjects' visual percepts. Therefore, on a longer stimulus exposure, the feedback from hierarchial 
visual areas can modify the brightness perception of the stimulus. 

Even, it can be anticipated that brightness matching technique ([Robinson &; de s"aLl2008l ) by looking at the target stripe 
several times during a trial of few seconds, can exhibit the influence of feedback signal responsible for percept on to 
the instantaneous stimulus response. This could be one possibility of getting minimal difference between the matched 
brightness of a square grating for 58 ms and 82 ms of exposure in their experiment (Robinson et al. 2008). 



Conclusion 

We have modelled that time dependence in brightness perception can be accommodated through the time evolution of 
cortical contribution to the orientation tuning of the oriented difference of Gaussians (ODoG) filter responses. Orienta- 
tion tuning has been implemented using a set of Difference of Gaussians functions . Our results can qualitatively explain 
the temporal dynamics of brightness perception observed by iRobinsoii fe de Sal (2008) for 58 and 82 ms of stimulus 
exposure. Computing mean response for three largest spatial scales of ODoG with a Gaussian window of smallest spatial 
scale among them, we observe that model's prediction on brightness induction (for 58 and 82 ms of exposure length) and 
White's illusion (for 82ms of stimulus exposure) matches with the psychophysical observation. Whereas, if mean response 
is computed by all scales of Gaussian window for 58 and 82 ms of stimulus exposure, our model predicts successfully the 
time evolution of brightness induction in square grating but the prediction of White's illusion is opposite to the observed 



7 



one. When the point of observation is shifted towards the interface of black region and illusory stimulus, the prediction 
for the target stripe of higher luminance (72 cd/m 2 ) of wide grating does not corroborate the psychophysical observation. 



Ackowledgements 

Authors are thankful to Alan E. Robinson for useful discussion on their work and giving the MATLAB code of FLODOG 
model. Authors are also thankful to Mr. Shaibal Saha for discussion on reverse correlation experiment. 

References 

Albrecht, D. G., Geisler, W. S., Frazor, R. A., & Crane, A. M. (2002). Visual cortex neurons of monkeys and cats: 
Temporal dynamics of the contrast response function. Journal of Neurophysiology, 88, 888-913. 

Blakeslee, B. & McCourt, M. E. (1999). A multiscale spatial filtering account of the white effect, simultaneous brightness 
contrast and grating induction. Vision Research, 39, 4361-4377. 

Carandini, M., Heeger, D. J., & Movshon, J. A. (1997). Linearity and normalization in simple cells of macaque primary 
visual cortex. The Journal of Neuroscience, 17, 8621-8644. 

Davey, M. P., Maddess, T., & Srinivasan, M. V. (1998). The spatiotemporal properties of the craik-o'brien-cornsweet 
effect are consistent with "filling-in" . Vision Research, 38, 2037-2046. 

De Valois, R. L., Webster, M. A., De Valois, K. K., & Lingelbach, B. (1986). Temporal limits of brightness induction 
and mechanisms of brightness perception. Vision Research, 26, 887-897. 

Devinck, F., Hansen, T., & Gegenfurtner, K. R. (2007). Temporal properties of the chromatic and achromatic craik- 
o'brien-cornsweet effect. Vision Research, 47, 3385-3393. 

Dragoi, V., Sharma, J., Miller, E. K., & Sur, M. (2002). Dynamics of neuronal sensitivity in visual cortex and local 
feature discrimination. Nature Neuroscience, 5, 883-891. 

Hubel, D. H. & Wiesel, T. N. (1962). Receptive fields, binocular interaction and functional architecture in the cats visual 
cortex. Journal of Physiology, 160, 106154. 

McCourt, M. E. & Foxe, J. J. (2004). Brightening prospects for early cortical coding of perceived luminance: A high- 
density electrical mapping study. Neuroreport, 15, 49-56. 

Pugh, M. C, Ringach, D. L., Shapley, R., & Shelley, M. J. (2000). Computational modeling of orientation tuning 
dynamics in monkey primary visual cortex. Journal of Computational Neuroscience, 8, 143-159. 

Resss, D. & Heeger, J. D. (2003). Neuronal correlates of perception in early visual cortex. Nature Neurosci., 6, 414. 

Ringach, D. L. (1998). Tuning of orientation detectors in human vision. Vision Research, 38, 963-972. 

Ringach, D. L., Hawken, M. J., & Shapley, R. (1997). Dynamics of orientation tuning in macaque primary visual cortex. 
Nature, 387, 281-284. 

Ringach, D. L., Hawken, M. J., & Shapley, R. (2003). Dynamics of orientation tuning in macaque vl: The role of global 
and tuned suppression. J. Neurophysiology, 90, 342-352. 

Robinson, A. E. & de Sa, V. R. (2008). Brief presentations reveal the temporal dynamics of brightness induction and 
white's illusion. Vision Research, 48, 2370-2381. 

Robinson, A. E., Hammon, P. S., & de Sa, V. R. (2007). Explaining brightness illusions using spatial filtering and local 
response normalization. Vision Research, 47, 1631-1644. 

Roeber, U., Wong, Y. M. E., & Freeman, W. A. (2008). Cross-orientation interactions in human vision. Journal of 
Vision, 15, 1-11. 

Rossi, A. F. & Paradiso, M. A. (1996). Temporal limits of brightness induction and mechanisms of brightness perception. 
Vision Research, 36, 1391-1398. 

Somers, D. C, Nelson, S. B., & Sur, M. (1995). An emergent model of orientation selectivity in cat visual cortical simple 
cells. The Journal of Neuroscience, 15, 5448-5465. 

Teich, F. A. & Qian, N. (2006). Comarison among some models of orientation selectivity. Journal of Neurophysiology, 
96, 404-419. 

White, M. (1979). A new effect of pattern on perceived lightness. Perception, 8, 413-416. 



8 




50 100 150 

Orientations 

(a) Thin square grating: 31 cd/m 2 
bordered by 12 cd/m 2 




50 100 150 

Orientations 



(b) Thin square grating: 31 cd/m 2 
bordered by 102 cd/m 2 




50 100 150 

Orientations 



(d) Thin square grating: 72 cd/m 2 
bordered by 102 cd/m 2 



50 100 

Orientations 



(e) Wide square grating: 31 cd/m 2 
bordered by 12 cd/m 2 



50 100 150 

Orientations 

(c) Thin square grating: 72 cd/m 2 
bordered by 12 cd/m 2 




50 100 150 

Orientations 

(f) Wide square grating: 31 cd/m 2 
bordered by 102 cd/m 2 




50 100 

Orientations 



(g) Wide square grating: 72 cd/m 2 
bordered by 12 cd/m 2 




50 100 150 

Orientations 

(h) Wide square grating: 72 
cd/m 2 bordered by 102 cd/m 2 




50 100 150 

Orientations 

(j) White Effect: gray test on 
white stripe 




50 100 150 

Orientations 

(i) White Effect: gray test on black 
stripe 



Figure 5: Solid line represents orientation distribution without intracortical feedback. Dashed line with dia- 
mond symbol represents the orientation distribution when orientation impulse response enhances orientation 
preference. Dashed line with square symbol represents the orientation distribution when the orientation impulse 
response is inverted with respect to the earlier one. 



9 



50 100 150 

Orientations 



(a) Thin square grating: 31 cd/m 2 
bordered by 12 cd/m 2 



50 100 150 

Orientations 

(b) Thin square grating: 31 cd/m 2 
bordered by 102 cd/m 2 




50 100 150 

Orientations 



(d) Thin square grating: 72 cd/m 2 
bordered by 102 cd/m 2 



50 100 150 

Orientations 

(e) Wide square grating: 31 cd/m 2 
bordered by 12 cd/m 2 



50 100 150 

Orientations 

(c) Thin square grating: 72 cd/m 2 
bordered by 12 cd/m 2 




50 100 150 

Orientations 

(f) Wide square grating: 31 cd/m 2 
bordered by 102 cd/m 2 




50 100 

Orientations 



(g) Wide square grating: 72 cd/m 2 
bordered by 12 cd/m 2 




50 100 150 

Orientations 

(h) Wide square grating: 72 
cd/m 2 bordered by 102 cd/m 2 




50 100 150 

Orientations 

(j) White Effect: gray test on 
white stripe 




50 100 150 

Orientations 

(i) White Effect: gray test on black 
stripe 



Figure 6: Solid line represents orientation distribution without intracortical feedback. Dashed line with dia- 
mond symbol represents the orientation distribution when orientation impulse response enhances orientation 
preference. Dashed line with square symbol represents the orientation distribution when the orientation impulse 
response is inverted with respect to the earlier one. 



10 










i 





50 100 

Orientations 



50 100 

Orientations 



150 



(a) Thin square grating: 31 cd/m 2 
bordered by 12 cd/m 2 



(b) Thin square grating: 31 cd/m 2 
bordered by 102 cd/m 2 




50 100 

Orientations 



50 100 

Orientations 



(d) Thin square grating: 72 cd/m 2 
bordered by 102 cd/m 2 



(e) Wide square grating: 31 cd/m 2 
bordered by 12 cd/m 2 




50 100 

Orientations 



150 



(c) Thin square grating: 72 cd/m 2 
bordered by 12 cd/m 2 




50 100 

Orientations 



(f) Wide square grating: 31 cd/m 2 
bordered by 102 cd/m 2 




100 150 

Orientations 




(g) Wide square grating: 72 cd/m 2 
bordered by 12 cd/m 2 



(h) Wide square grating: 72 
cd/m 2 bordered by 102 cd/m 2 



Figure 7: Solid line represents orientation distribution without intracortical feedback. Dashed line with dia- 
mond symbol represents the orientation distribution when orientation impulse response enhances orientation 
preference. Dashed line with square symbol represents the orientation distribution when the orientation impulse 
response is inverted with respect to the earlier one. 



11 



